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Abstract: Carotenoid-protein complex (CPC) was isolated from chromoplast-enriched suspensions of cassava storage 
root (CSR) using size exclusion chromatography and characterized. Peptide sequences (LC_MS/MS spectrum) obtained 
from CPC and their corresponding proteins were obtained using publically available databases. Small Heat Shock Proteins 
(sHSPs) were the most abundant proteins identified in the CPC. Western blot analysis showed that Fribrillin and Or- 
protein were present in chromoplast-enriched suspensions of yellow root but not in the complex or white root. Results 
from qRT-PCR helped identify an isoform of HSP21 possessing four single point mutations in the intense yellow CSR 
that may be responsible for increased sequestration of b-carotene. 
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INTRODUCTION 

Plastid lipid-associated proteins (PAPs) and Carotenoid 
Associated Proteins (CAP) are general terms conferred to 
two classes of proteins accumulated at high levels in chro- 
moplasts of plant cells [1,2] that are involved in the seques- 
tration of hydrophobic compounds including lipids and caro- 
tenoids. These proteins are being investigated by several 
groups to establish the role of their parallel accumulation 
with carotenoids [3] and lipids of different types [4,5]. These 
investigations also include examining chromoplast formation 
[2,3,6], which specially focus on the Fibrillar type that forms 
a carotenoid-protein complex (CPC) within supramolecular 
structures where different carotenoid types are sequestered 
[2,7,8]. Most of this information was obtained from green 
tissues where carotenoid function is well documented in 
dissipating excess excitation energy by participating in non- 
photochemical quenching, which is essential in protecting 
the chloroplasts from photo-oxidative damage [9] and lipid 
changes that occur during leaf senescence due to 
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interconvertion of chloroplasts into chromoplast [5]. Here we 
provide information on the characterization of carotenoid- 
protein complex isolated in cassava storage root that is a non 
green tissue. 

Analysis of sequence data generated by LC-MS/MS is 
one of the classic PROTEOMICs technologies widely 
adopted for sequencing complex protein mixtures and pre- 
dicting individual proteins [10]. Although erroneous protein 
identification is appreciated [11], the identification of indi- 
vidual proteins by LC-MS/MS is largely supported by statis- 
tical scoring systems [12, http://systemsbiology.org/re- 
search/software.html.], which increase our confidence about 
protein sequence identification and homology matches to 
peptide sequences deduced from species specific EST data- 
bases, including the EMBRAPA cassava genome project da- 
tabase [http://genoma.embrapa.br/genoma/], ESTIMA data- 
base [http://titan.biotec.uiuc.edu/cgi-bin/ESTWebsite/-esti- 
ma_start?seqSet=cassava], CIAT data base [http://web- 
app.ciat.cgiar.org/biotechnology/bioin-formatics.htm] , Riken 
full-length cDNA database [http://www.brc.riken.go.jp/lab/ 
epd/Eng/catalog/pDNA.shtml] and the Arabidopsis counter- 
parts (ATG_ORTHOLOGUE) (http://www. arabidopsis. 
org/). These resources and other public databases facilitate 
identification of proteins and there corresponding gene se- 
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quences using validated peptide sequences derived from 
shotgun PROTEOMIC approaches. Here, we report results 
from combining chromoplast-enriched suspension prepara- 
tion, protein-carotenoid complex separation in conventional 
size exclusion chromatography (SEC), protein separation by 
SDS-PAGE, and shotgun PROTEOMIC sequence tech- 
niques to identify proteins from carotenoid-protein complex 
present in cassava storage root. 

MATERIALS AND METHODOLOGY 

Plant Material and Tissue Preparation 

Cassava Storage Root (CSR) from field-grown plants at 
EMBRAPA Genetic Resources and Biotechnology Research 
Unit (Brasilia, DF. Brazil) were used in this study. For pro- 
tein extraction, quantification and protein blot analysis we 
used landrace BGM019 (intense yellow root) and the cv. 
IAC12.829 (white root). Protein sequence from carotenoid- 
protein complex was obtained from landrace BGM019. For 
gene expression analysis we used landrace BGM019 and cv. 
IAC12-829. Fresh storage roots (cylinders of 30-40 cm 
length x 4-6 cm diameters) were manually sliced and imme- 
diately frozen in liquid nitrogen, freeze-dried and stored at - 
80°C until used. For the chromoplast-enriched suspension 
studies, fresh intact storage roots were peeled off and proc- 
essed immediately after harvest. 

Protein Extraction, Quantification and Blot Analysis 

One gram of freeze-dried CSR tissue was rehydrated and 
added with 2 ml of extraction buffer (pH 6.8, 5% SDS, 10% 
glycerol, 80 mM Tris, and 25 mM DTT), and 20 ml acetone 
and incubated at -20°C for more than lh to obtain a protein 
precipitate free of pigments. After centrifugation (30000 rpm 
@ 4°C for 20 min) the supernatant was drained, the pellet 
dried by blowing N 2 air and suspended with 5 ml extraction 
buffer (EB). The re- suspended solution was centrifuged 
again (30000 rpm @ 4°C for 10 min) to collect 1ml aliquot 
of the supernatant that was stored at -80°C until use. Ali- 
quots (100 jliI) were treated with DOC (0.15%) and TCA 
(12%) to precipitate soluble proteins. After centrifugation 
(13000 rpm @ 4°C for 20 min) the pellet was suspended in 
100 jliI of AB, and 50 jul was used for tissue protein estima- 
tion by the BCA Micro Kit method (PIERCE) according to 
the manufacturer protocol. Proteins from crude extract (CE), 
peakl and 2 (30 |ug each) were separated by 12% SDS- 
PAGE and blotted onto nitrocellulose as previously de- 
scribed [13]. Transblots were blocked with nonfat dry milk 
and incubated with primary antibody to Or-protein or Fi- 
brillin-protein at a 1:200 dilution and secondary AP- 
conjugated goat-anti-rabbit IgG antibody (Bio-Rad) at 
1:3000 dilution. Western Blue Stabilized Substrate 
(Promega) was used for chromogenic detection. 

Preparation of Chromoplast Enriched Suspensions 

Fresh CSR was harvested, washed with tap water, sliced, 
and homogenized with Homogenate Buffer (HB) adjusted to 
pH 8.2 (100 mM Tris-HCl, 8 mM EDTA, 10 mM KC1, 2 
mM MgCl 2 , 400 mM sucrose, and 1 mM PMSF) using a 
household blender to obtain a paste that was filtered through 



The Open Biochemistry Journal, 2012, Volume 6 117 

three layers of cheesecloth. The filtrate was centrifuged (600 
rpm @ 4°C for 30 min) and the resulting supernatant was 
centrifuged (2000 rpm @ 4°C for 30 min) a second time. The 
resulting pellet was suspended by brushing the surface with 
50 mL HB, washed once with HB, followed by centrifuga- 
tion (2000 rpm @ 4°C for 30 min). The final pellet was col- 
lected by surface brushing with 50 mL of suspension buffer 
(SB) adjusted to pH 6.8 (100 mM Tris-HCl, and 250 mM 
NaCl), named Non-denatured Chromoplast-Enriched Sus- 
pension (NDCES), and used for separation of carotenoid- 
protein complex and non-carotenoid associated proteins by 
Size Exclusion Chromatography (SEC). 

Size Exclusion Chromatography of NDCES 

Chromoplast-enriched suspensions were sonicated for 10 
min and 5 mL was loaded onto a column (1 m length x 1.5 
cm internal diameter) filled with a resin-bed of Sepharose 
CL 6B-200 and equilibrated with SB. Fractions (1 mL) were 
collected at a flow rate of 1.5 mL min" 1 and OD readings @ 
280 nm and 461 nm were recorded using a spectrophotome- 
ter (Spectra max Plus_384, Molecular Devices) using UV 
transparent microplate. Fractions from two observed peaks 
were pooled separately, named Peak 1 (fractions 33-49, 
carotenoid-protein complexc - CPC) and Peak 2 (fractions 
108-122, non carotenoid-protein complex - NCPC), concen- 
trated, and stored at -80°C until further analyzes. 

HPLC Analysis of Peak 1 Carotenoids 

Pooled fractions 33-49 from SEC were treated with ace- 
tone to isolate the carotenoid constituent, which was further 
separated on a C30 column according to [14] to identify the 
composition of carotenoids present in Peak 1. 

Protein Preparation for LC_MS/MS Analysis 

Pooled protein fractions from peaks 1 and 2 were sepa- 
rated by one-dimensional SDS_PAGE as previously de- 
scribed [10,15,16] and gels zones were excised and used for 
in-gel protein digestion with trypsin. Equivalent protein 
amount (100 mg) from both peak 1 and 2 were individually 
transferred to a siliconized eppendorf tube and dried in a 
speed- vac devise. The dried pellet was re- suspended in 200 
mL of TCEP solution (10 mM) prepared fresh in ambicard 
and incubated at 56°C for one hour and cooled to room tem- 
perature. Proteins were then precipitated with 4 volume of 
acetone after incubation at -80°C for one hour. After cen- 
trifugation at maximum speed for 30 min at 4°C, the pellet 
was collected and added to 200 mL of Iodacetamide (55 
mM) in ambicarb, and incubated for one hour at room tem- 
perature in the dark. The sample was incubated at -80°C for 
one hour with 4 volumes acetone and then centrifuged 
(maximum speed @ 4°C for 30 min). The resulting pellet 
was incubated with 100 mL of ambicarb by vortexing and 
sonicating for one hour. A trypsin solution (20 mg mL" 1 in 
ambicarb) was added to have a protease :protein ration of 
1:50, incubated at 37°C for the first two hours, after which 
an additional 100 mL of the same trypsin solution was added 
for incubation over night. After cooling to room temperature 
the sample was used for mass spectrometry analysis. 
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LC-MS/MS Analysis 

After initial protein digestion, peptides were ionized, 
detected, isolated, fragmented and sequenced in a qTOF 
MALDI MS/MS. The operating software generated MS and 
MS/MS spectra, and the peptide sequences were determined 
using the PROTEOMIC facilities and software available at 
USD A/ARS -National Animal Disease Center in Ames, 
Iowa. 

Bioinformatics Analysis of Peptide and Protein Sequence 
Data 

Peptide sequences were established using a MASCOT 
server database (http://www.matrixscience.com/cgi/search_ 
form.pl?FORMVER=2&SEARCH=PMF) and statistical 
validation as well as protein sequence matching identifica- 
tion. Peptide sequences that produced no protein hits were 
further analyzed using tblastx and translation tools available 
in ExPASy at Proteomics Server (http://www.expasy.org/) 
using four cassava EST databases located at CIAT 
(http://webapp.ciat.cgiar.org/biotechnology/bioinformatics.ht 
m), ESTIMA (http://titan.biotec.uiuc.edu/cgi-bin/ESTWeb- 
site/estima_start?seqSet=cassava), EMBRAPA (http://www. 
genoma.embrapa.br/), and RIKEN (http://www.brc.nken. 
go.jp/lab/epd/Eng/species/manihot.shtml). Finally, the pro- 
tein fragment sequences derived from the EST matching 
results were mapped to an Arabidopsis protein database 
[http://www.arabidopsis.org] and to cassava PROTEOME 
and GENOME databases [http://www.phytozome.net/cas- 
sava]. Information on cassava proteome loci, for identified 
protein, and genome scaffold (for their corresponding gene) 
were obtained to characterize their location and distribution 
of identified proteins in the carotenoid-protein complex. 

RNA Isolation, Quantification, and cDNA Synthesis 

Total RNA was extracted twice using the phenol chloro- 
form procedure described previously [17] and treated with 
RNAse free DNAse. Total RNA was quantified using the 
Quant-It RiboGreen RNA Kit according to the manufacturer 
(Molecular Probe). Total RNA (400 ng) was reverse tran- 
scribed in a 20 juL reaction volume using a Superscript III 
Platinum Two- Step qRT-PCR Kit (Invitrogen). The reac- 
tions were terminated by heat inactivation and the cDNA 
products were treated with RNase H and stored at -20°C. 

Gene-Specific Primers, Certified Standard and House- 
keeping Gene Primers 

Fluorogenic primers (FAM labeled LUX primer) and 
corresponding unlabeled primers were designed using the 
LUX Designer-Desktop version (Invitrogen) for each se- 
quence of the cDNA fragment coding for genes as shown in 
Table SI (see supplementary data). Standard certified prim- 
ers for 18S ribosomal (Invitrogen Cat. 115HM-02) and Gus 
(Invitrogen Cat. 112H-02), and qPCR plasmid Standards 
(Invitrogen Cat. 11741-100) with Gus ORF were used as 
internal controls and quantitative standards to generate stan- 
dard curves respectively. All primers were synthesized and 
purchased from Invitrogen. 



Quantitative Real Time PCR Analysis 

Gene sequences coding for proteins identified from the 
carotenoid-protein complexes were used to quantify tran- 
script abundance in white and intense yellow CSR using 
(qRT-PCR). The qRT-PCR assays were performed in tripli- 
cate for each extraction using a Bio-Rad system (BioRad 
model iCycler). Ribosomal RNA (18S) was used as an inter- 
nal control and PCR efficiency was evaluated using GUS 
certified primers to obtain a quantitative standard curve for 
absolute expression analysis of target transcript level in sam- 
ples. Each experiment was replicated at least three times. 
Alternatively, a relative expression assay using MEC 1 (gene 
coding for Pt2L4 protein, 17) was used as an internal control. 
qRT_PCR products were sequenced at the EMBRAPA Ge- 
netic Resources and Biotechnology Genome facilities. 

Data Analysis 

To access the intrinsic property for the level of each 
transcript corresponding to the target gene in all samples, 
qRT-PCR amplification procedures consisted of absolute 
quantification expression using total RNA as normalizer. 
These methods provided an absolute quantitative standard 
curve for a dilution series based on the qPCR plasmid stan- 
dards kit protocol (Invitrogene). Preliminary experiments 
with two unknown samples were performed to set up qRT- 
PCR amplification optimization conditions, including reli- 
able exponential phase of amplification, qRT-PCR efficiency 
ranging from 90% to 105%, Ct threshold and baseline, and 
evaluation of each primer sets for target gene within an un- 
known sample. After setting up the optimized conditions, Ct 
values for all unknown samples and replications were inter- 
polated from the standard curve estimating the level of tran- 
script in unknown samples. In addition, a five point curve 
made with each amplicon was performed to obtain the effi- 
ciency of each primer (needed for the AAct formula) and to 
know the range of amplification. After setting up the opti- 
mized conditions, Ct values for all unknown samples and 
replications were collected and normalized with housekeep- 
ing genes (internal controls) using the AAct formula. Statisti- 
cal parameters, such as mean and mean standard deviation, 
were performed in Excel. 

RESULTS 

Carotenoid-protein Complex Isolation and Characteriza- 
tion 

Proteins from non-denatured chromoplast-enriched sus- 
pension (CES) present in white and intense yellow cassava, 
containing chromoplast- associated proteins (CAP) were 
separated by SEC Fig. (1A and IB), respectively. 

Size exclusion chromatography profiles monitored with 
Abs 46 i (carotenoids) and Abs 2 so (proteins) indicated the pres- 
ence of two major peaks. Peak 1 (fractions 33-49) eluted 
within the void volume of the column, as determined with 
blue dextran (MW 2000 kDa) and overlap between the caro- 
tenoid and protein profiles indicates that Peak 1 represents a 
carotenoid-protein complex (CPC) with an estimated size 
close to 400 KDa. Peak 2 (fractions 108-122) eluted later in 
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the SEC profile and no observed overlap between carote- 
noids and proteins in Peak 2 suggests it represents non- 
carotenoid-associated proteins (NCPC). Proteins present in 
CPC and NCPC were separated by SDS-PAGE to visualize 
protein complexity and to fractionate proteins for 
LC_MS/MS analysis. From the gel image (see Supplemen- 
tary data complementing Table S2 and S5), a greater number 
of proteins were observed in fractions forming NCPC than in 
fractions forming CPC. Peptide and protein sequences from 
Peaks 1 and 2 identified 83 and 106 peptides that assembled 
into 34 and 38 proteins respectively. Carotenoids from 
pooled fractions of CPC, separated by HPLC, indicate that P- 
carotene is the major carotenoid of the CPC isolated from 
both white (Fig. 2 A) and intense yellow cassava (Fig. 2B) 
phenotypes. These results confirm the presence of carote- 
noid-protein complexes in cassava storage root that is similar 
to the one reported for carrot root [18-20] in terms of size, 
but different in the type of carotenoid and proteins types that 
includes P-carotene, a-carotene, and lutein as the major 
carotenoids and a single major protein of 18kDa in the CPC 
of carrot. 



Peptide Sequence and Protein Identification in Chromo- 
plast Enriched Suspension (CES) 

Reverse-phase capillary chromatography (capillary LC) 
with quadrupole time-of-flight tandem spectrometry (nano- 
ESI Q-TOF MS/MS) was used with in gel tryptic digestion 
of proteins from CPC and NCPC. The LC-MS/MS peptide 
sequences were validated by MASCOT (http://www. matrix- 
science.com/cgi/search_form.pl?FORMVER=2&SEARCH= 
PMF) and used to identify proteins. The proteins sequences 
identified in CPC and NCPC are listed in the Supplementary 
data files in Table S2 and Table S5. A total of 65 proteins 
were identified, with 39 identified in NCPC and 26 in CPC. 
The majority of the proteins identified in both CPC (Fig. 3 A) 
and NCPC (Fig. 3B) are heat shock proteins (HSP) that are 
differentially distributed in the two fractions , which are the 
focus of discussion in this work. Out of 10 HSPs observed in 
both CPC and NCPC, four of them are present only in CPC 
(HSP15.7, HSP17.4I, HSP18.1I, and HSP26.5), four are 
present only in NCPC (HSP17.4, HSP17.6II, HSP26.5I and 
HSP70.1) and the remaining six are present in both fractions. 
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Fig. (2). Carotenoid HPLC elution profile from the carotenoid-proteins complex (CPC) isolated by SEC as shown in Fig. (1). (A) spectrum 
from white cassava storage root. (B) Spectrum from intense yellow cassava storage root. (1) is Cis 9 P-carotene, (2) is All E p-carotene, (3) 
is Cis 13 P-carotene separated in a C30 column and identified by using external p-carotene standard, p-carotene absorption spectrum and 
retention time. 
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This differential distribution of HSPs between these two 
fractions may indicate (1) differential properties of these 
HSPs to complex with carotenoids, (2) differential solubility 
in the SDS-PAGE gel separation procedure used in the pre- 
sent work, or (3) aggregation dynamics of a particular HSP 
in vitro or in vivo as reported by others [21]. 

Occurrence of Fibrillin- and Or-protein in White and 
Intense Yellow Cassava 

Protein-blot analysis was used to test for the presence of 
Fibrillin- and Or-proteins as markers for fibrillar structure of 
chromoplast occurrence in CSR as suggested in others plant 
systems [7,22], as well as chromoplast formation and differ- 
entiation [23,24] respectively. Fibrillin- (Fig. 4A) and Or- 
protein (Fig. 4B) are present in CES and NCPC of intense 
yellow cassava, but not in CPC isolated fractions or CES of 
the white cassava. These results suggest that the type of 
chromoplast structure accumulating carotenoid in cassava is 
of fibrillar-type . Since Fibrillin- and Or-proteins were not 
detected in the CES or in CPC of white cassava, these results 
also indicate that intense yellow root accumulate more 
chromoplasts than white roots of cassava. At the same time, 
these results suggest that further microscopic studies will be 
needed to confirm the form, number and the behavior of 
chromoplast in CSR, which was not addressed in the present 
work. 

Assign Assessed CAP Sequences to Cassava PRO- 
TEOME and GENOME 

List of peptide and identified proteins sequences present 
in CPC are showed in Supplementary data in Table S2 and 
Supplementary data in Table S3 respectively. List of peptide 
and identified proteins sequences present in NCPC are 
showed in Supplementary data in Table S4 and Supplemen- 
tary data in Table S5 respectively. The identified proteins 
were assigned to cassava PROTEOME and GENOME data- 
bases and results are summarized in Tables 1 to 4. 

It was observed that 30 identified proteins in CPC were 
distributed in 28 cassava-protein loci and 19 families while 
33 proteins in NCPC are distributed in 44 loci and 25 fami- 
lies. The HSP group individuals for CPC are located in 9 loci 
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Fig. (4). Western blot analysis of proteins (35|ug/lane) extracts 
using antiserum raised against Fibrillin- (A) and Or-protein (B). 
Line 1 is chromoplast enriched suspension. Line 2 is total protein 
fraction from CPC. Line 3 is total protein fraction from NCAP 
identified in the profile of size exclusion chromatography as shown 
in Fig. (1). 
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Table 1. Identified Proteins in the Carotenoid-Protein Complex and their Distribution in the Cassava Genome 



Cassava Genome Scaffold 


Score 


E_Value 


Protein Name Abbreviation 


scaffold06948 


971.5 


0 


ACT7 


scaffold09150 


836.2 


0 


Actin 


scaffoldl0689 


291.6 


4.9e-77 


Alpha_SNAP2 


scaffold04718 


513.4 


6.9e-144 


ATGRP7 


scaffold03604 


1004.0 


0 


ATPase 


scaffold04355 


921.0 


0 


ATPase 


scaffold03264 


73.4 


2.7e-ll 


ATPsyn 


scaffold04355 


161.8 


4.5e-38 


ATPsyn 


scaffold04355 


248.3 


l.le-63 


ATRZ-1A 


scaffold06708 


629.8 


3.4e-179 


BE 


scaffold08729 


132.9 


1.8e-29 


COX2 


scaffold06028 


499.0 


1.8e-139 


Cyto_cl 


scaffold06697 


502.6 


1.8e-140 


Cyto_cl 


scaffold05875 


122.1 


6.8e-26 


Dihy_acet 


scaffold06708 


747.9 


0 


ERD10 


scaffold08053 


302.4 


2.2e-80 


HIR1 


scaffoldl2794 


261.0 


6.6e-68 


HIR1 


scaffold06656 


244.7 


4.4e-63 


Histone 


scaffold05875 


731.7 


0 


HSP_17.4I 


scaffold06512 


1292.5 


0 


HSP_17.6I 


scaffold06908 


949.9 


0 


HSP_17.6I 


scaffold06512 


868.7 


0 


HSP_18.1I 


scaffold06908 


1323.2 


0 


HSP_18.1I 


scaffold06512 


868.7 


0 


HSP_18.2 


scaffold06908 


1323.2 


0 


HSP_18.2 


scaffold08265 


1545.0 


0 


HSP_18.2 


scaffold06512 


868.7 


0 


HSP_20 


scaffold07543 


1041.8 


0 


HSP_21 


scaffold09151 


1204.1 


0 


HSP_22.0 


scaffold06701 


1162.7 


0 


HSP_23.6 


scaffold04151 


881.3 


0 


HSP_26.5 


scaffold04355 


161.8 


4.5e-38 


HSP17.6 


scaffold06512 


358.4 


4.0e-97 


HSP20 


scaffold06908 


949.9 


0 


HSP20 


scaffold08265 


250.2 


3.0e-65 


HSP20 


scaffold00069 


978.7 


0 


HSP21 
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Cassava frPnomf 1 Sraffnlrl 


Score 


E Value 


Protein Nsnic Abbreviation 


scarrolaUo/Ul 


1162.7 


0 


HSF23.0 


scarrolaU22o4 


149.2 


2.3e-34 


MAB1 


scaliold00069 


1038.2 


0 


NADH 


scairolalUUw 


468.4 


3.2e-130 


OEP16 


scairolaU7528 


1402.5 


0 


Oxireductase 


scairolaU/ojl 


834.4 


0 


PATL1 


scairolaU6782 


573.0 


2.2e-161 


TV 1\ /TTMT1 

PMDH2 


scaiioiuuouoy 


71 q n 


u 


KJjr j 1 


scaffold00325 


1031.0 


0 


RRM_1 


scaffold07851 


383.6 


8.2e-105 


SEC14 


scaffold02892 


798.4 


0 


snRNP 


scaffoldl2794 


1123.0 


0 


snRNP 



Table 2. Identified Proteins in the Carotenoid-Protein Complex Assigned to Cassava Proteome Locus and their Distribution in 
Protein Families Assigned by Pfam Search 



Pfam 


Manihot Locus 


Protein Name Abbreviation 


Score 


E_ Value 


PF00022 


cassava4.1_033108m 


ACT7 


312.0 


4.8e-85 


PF00022 


cassava4.1_033108m 


Actin 


350.5 


l.le-96 


PF00515 


cassava4. 1_01 3 1 35m 


ASNAP2 


367.5 


2.8e-102 


PF10559 


cassava4.1_006793m 


ATPase 


194.5 


3.5e-50 


PF00306 


cassava4. l_004726m 


ATPsyn 


228.0 


3.2e-60 


PF02806 


cassava4.1_001595m 


BE 


1726.8 


0 


PF02167 


cassava4.1_012354m 


Cyt_cl 


425.6 


l.le-119 


PF00198 


cassava4. l_004864m 


Dihy_acet 


440.3 


5.2e-124 


PF00257 


cassava4.1_015875m 


ERD10 


116.7 


7.3e-27 


PF04716 


cassava4.1_017840m 


RBP31 


336.7 


1.9e-92 


PF01145 


cassava4.1_013221m 


HIR1 


252.7 


6.6e-68 


PF00125 


cassava4. 1_0 1 99 1 1 m 


Histone 


167.5 


2.1e-42 


PF00011 


cassava4.1_019659m 


HSP_15.7 


225.7 


1.6e-59 


PF00011 


cassava4.1_018038m 


HSP_17.4I 


164.1 


1.5e-61 


PF00011 


cassava4.1_018127m 


HSP_17.6I 


261.2 


3.8e-70 


PF00011 


cassava4.1_017871m 


HSP_18.1I 


291.2 


3.8e-79 


PF04969 


cassava4.1_017974m 


HSP_18.2 


266.9 


9.5e-72 


PF00011 


cassava4.1_017871m 


HSP_20 


284.3 


2.7e-77 


PF00011 


cassava4.1_015256m 


HSP_21 


396.4 


l.le-110 


PF04969 


cassava4.1_032436m 


HSP_22.0 


352.8 


9.6e-98 


PF00011 


cassava4.1_016267m 


HSP_23.6 


382.9 


1.3e-106 
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Pfam 


JVIiuiihot Locus 


Protein Name Abbreviation 


Score 


E Value 


PFUUU11 


cassava4.1_U15518m 


HSP_2o.j 


305.8 


2.3e-83 


n™ i /no 

PFU1423 


cassava4.1_(J1995om 


snRJNr 


58.9 


2.8e-9 


PF02779 


cas sava4 . 1 _0 1 04 1 5 m 


MAB1 


237.3 


1.6e-63 


PF02466 


_ , _ ., , _ A 1 Ai OC1 O _ 

cassava4. 1_U1 85 1 3m 


OEP16 


256.1 


9.4e-69 


rrUulu / 


cassava4. 1_03 1433m 


Oxireductase 


j 1 1 . j 


^.oe-iuj 


PF02866 


cassava4.1_010611m 


PMDH2 


612.1 


0 


PF00076 


cassava4.1_012182m 


RNP 


304.7 


6.5e-83 


PF00076 


cassava4.1_019294m 


RRM_1 


161.4 


4.5e-40 


PF03765 


cassava4.1_003845m 


SEC 14 


309.3 


6.2e-85 



Table 3. Identified Proteins in the Non-Carotenoid-Protein Complex and their Distribution in the Cassava Genome 



Cassava Scaffold 


Total Score 


E_ Value 


Protein Name Abbreviation 


scaffold09150 


839.9 


0 


ACT7 


scaffold03602 


206.9 


2.2e-51 


ACT8 


scaffold03942 


917.4 


0 


AGP8 


scaffold06708 


482.8 


1.0e-134 


Amylase 


scaffold00506 


398.0 


8.5e-109 


APX2 


scaffold06844 


1067.1 


0 


APX2 


scaffold00069 


978.7 


0 


Aspartyl protease 


scaffold00847 


906.6 


0 


ATMAPR2 


scaffold06708 


482.8 


1.0e-134 


BE 


scaffold00506 


398.0 


8.5e-109 


CPN20 


scaffoldl0114 


462.9 


2.3e-128 


CPN20 


scaffold04285 


621.6 


2.8e-176 


CSD1 


scaffold06705 


1104.9 


0 


CSD1 


scaffold06844 


1067.1 


0 


ERD10 


scaffold04953 


627.1 


1.3e-177 


GAPC 


scaffold06948 


720.8 


0 


GAPC 


scaffold07543 


122.1 


6.2e-26 


GAPC 


scaffold03245 


145.6 


6.2e-33 


GAPC 


scaffold07660 


356.5 


1.3e-96 


Glutaredoxin 


scaffold07762 


1103.1 


0 


Glycine-rich protein 


scaffold05255 


919.2 


0 


HSP17.4 


scaffold03651 


1106.8 


0 


HSP17.4 


scaffold01127 


1005.8 


0 


HSP17.4 


scaffold07859 


1440.4 


0 


HSP17.4 


scaffold06908 


502.3 


4.3e-141 


HSP17.4 
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Cassava Scaffold 


Total Score 


E Value 


Protein Name Abbreviation 


scaffold 1U469 


1287.1 


0 


TJCril H A 

HSP17.4 


scaffold 12439 


982.3 


0 


TJCDI H A 

HSP17.4 


scarf oldU9151 


1204.1 


0 


HSP17.4 


X'X' i ia/:c 1 o 

scarroldU6512 


1224.0 


0 


HSP17.61 


XV? „ 1 JACIf C 

scarfoldU5255 


919.2 


0 


HSP17.61 


scaffold 10469 


1287.1 


0 


HSP 17.611 


xv„ i jmo cr» 

scarfold07859 


1440.4 


0 


HSP 17.611 


scattoldU69U8 


1285.3 


0 


Tjcni o o 
HSP18.2 


scarf olaU5 5/5 


731.7 


0 


Tjcni o o 
HSP18.2 


scaffolaU5255 


919.2 


0 


1 i c r> i o o 

HSP18.2 


xv?^ uii^in 

scaffold 12439 


1561.2 


0 


HSP18.2 


XV? „ 1 JA01£C 

scarfold08265 


1545.0 


0 


TTOA1 O <"> 

HSP18.2 


X'X' 1 JA£AAO 

scarfold06908 


1285.3 


0 


TTOA1 O <"> 

HSP18.2 


X'X' 1 jA/;r 1 o 

scarrold06512 


392.6 


2.7e-107 


HSP20 


rv- 1 JA01£C 

scarfold08265 


250.2 


3.0e-65 


HSP20 


scartoldUUU69 


1038.2 


0 


HSP21 


scattola(J3241 


872.3 


0 


HSP21 


scattoldUU467 


596.4 


l.le-168 


HSP21 


XV? _ i j rvo on 

scatfold02817 


1178.9 


0 


HSP21 


X'X' 1 JA^AAO 

scarfold06908 


502.3 


4.3e-141 


HSP22.0 


£f„1 JAA1 C1 

scarf old091 51 


1204.1 


0 


HSP22.0 


X'X'„ 1 J f\f\f\/lC\ 

scarfold00069 


978.7 


0 


HSP23.6 


X'X'„ 1 JA^C 1 O 

scarrold06512 


868.7 


0 


HSP23.6 


scarf oldU41 51 


881.3 


0 


HSP26.51 


scattoldU8265 


571.2 


3.8e-161 


HSP7U-1 


scattoldU8265 


250.2 


3.0e-65 


ATA T~iT> O 

JV1APR2 


X'X'„ 1 JA^ 1 7f 

scarrold04175 


419.7 


1.0e-115 


NAD binding 


X'X' 1 JA^CAO 

scarfold06598 


378.2 


3.4e-103 


Pepsin A 


X'X' 1 J/"\zTO A A 

scarfold06844 


1067.1 


0 


Pt2L4 


x'X' i jmo ca 

scarfold07859 


1440.4 


0 


Pt2L4 


scaffold 11 63 8 


1393.5 


0 


ROC1 


scartoldU8559 


903.0 


0 


Stress-inducible 


SCaiiOlQUU^tO / 


^90 7 


^ 1p 1 Afs 
0. 1c- l^tO 


Subtilase 


scaffoldll661 


414.3 


3.4e-114 


SUM2 


scaffold00847 


780.3 


0 


TPX1 


scaffoldl0669 


699.2 


0 


UBQ10 


scaffoldl2498 


558.5 


2.3e-157 


UBQ2 



Carotenoid-protein Complexes in Cassava Storage Root 
Table 3. Contd 



The Open Biochemistry Journal, 2012, Volume 6 125 



Cassava Scaffold 


Total Score 


E_Value 


Protein Name Abbreviation 


scaffold03055 


1043.6 


0 


UBQ2 


scaffoldl0669 


699.2 


0 


UBQ3 


scaffold00069 


978.7 


0 


UBQ6 


scaffold07591 


749.7 


0 


Unkown 



Table 4. Identified Proteins in Non-carotenoid-protein Complex Assigned to Cassava Proteome Locus and their Distribution in 
Protein Families Assigned by Pfam Search 



Pfam 


Cassava Locus 


Protein Name Abbreviation 


Total Score 


E_Value 


PF00022 


cassava4. 1_033 1 08m 


ACT7 


371.3 


3.4e-103 


PF00022 


cassava4.1_033108m 


ACT8 


448.7 


1.5e-126 


PF02806 


cassava4.1_001595m 


AMYLASE 


255.8 


6.2e-69 


PF00141 


cassava4.1_0 14643m 


APX2 


484.6 


3.8e-137 


PF03489 


cassava4. l_005735m 


Aspartyl protease 


517.7 


4.6e-147 


PF00173 


cassava4.1_0 19974m 


ATMAPR2 


203.8 


6.6e-53 


PF00166 


cassava4.1_0 14439m 


CPN20 


461.5 


3.2e-130 


PF00080 


cassava4.1_0 18289m 


CSD1 


278.9 


1.8e-75 


PF00257 


cassava4.1_015875m 


ERD10 


188.7 


3.7e-48 


PF02469 


cassava4. 1_03 1 63 1 m 


Fasciclin 


323.2 


4.5e-89 


PF00044 


cassava4.1_01 1175m 


GAPC 


402.5 


1.6e-112 


PF00044 


cassava4 . 1 _0 111 76m 


GAPC 


287.0 


6.3e-78 


PF00044 


cassava4.1_01 1326m 


GAPC 


394.4 


2.9e-110 


PF00044 


cassava4.1_01 1366m 


GAPC 


634.4 


0 


PF00462 


cassava4.1_0 19777m 


glutaredoxin 


203.0 


7.9e-53 


PF01277 


cassava4 . 1 _0 16619m 


glycine-rich protein 


243.4 


7.0e-65 


PF00011 


cassava4.1_015256m 


HSP17.4 


403.3 


9.5e-113 


PF00011 


cassava4.1_016281m 


HSP17.4 


302.8 


1.5e-82 


PF04969 


cassava4.1_0 17267m 


HSP17.4 


295.0 


9.6e-80 


PF00011 


cassava4.1_017871m 


HSP17.4 


291.2 


3.8e-79 


PF00011 


cassava4.1_0 18066m 


HSP17.4 


241.1 


2.4e-64 


PF00011 


cassava4.1_01 8093m 


HSP17.4 


281.6 


3.8e-76 


PF00011 


cassava4.1_018121m 


HSP17.4 


305.4 


1.4e-83 


PF00011 


cassava4.1_018127m 


HSP17.4 


261.2 


4.0e-70 


PF00011 


cassava4.1_018158m 


HSP17.4 


267.7 


5.5e-72 


PF00011 


cassava4. 1_01 8376m 


HSP17.4 


295.8 


1.0e-80 


PF00011 


cassava4.1_017871m 


HSP17.6I 


291.2 


3.8e-79 


PF00011 


cassava4.1_018127m 


HSP17.6I 


261.2 


3.8e-70 


PF00011 


cassava4.1_018131m 


HSP17.6I 


167.2 


2.1e-56 
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Pfam 




Protein Name Abbreviation 


Total Score 


E Value 


nTTAAH 1 1 

PPUUU11 


cassava4 . 1 _U 1 8 1 5 8m 


HSP17.611 


267.7 


5.5e-72 


PPUUU11 


cassava4.1_U17871m 


Tjcni o o 

HSP18.2 


291.2 


3.8e-79 


TYr~7A aa i i 

PFUUU11 


cassava4.1_0 18038m 


i inn i o o 

HSP18.2 


164.1 


1.5e-61 


PPUUU11 


cassava4.1_U18U66m 


HSP18.2 


241.1 


2.4e-64 


PPUUU11 


_ , _ ., , _ /I 1 A 1 TOT 1 _ 

cassava4.1_U17871m 


HSP2U 


291.2 


3.8e-79 


PPUUU11 


_ , _ ., , _ /I 1 A 1 o 1 O 1 

cassava4.1_U18131m 


HSP2U 


235.0 


3.7e-62 


PPUUU11 


cassava4 . 1 _U 1 5 25 om 


HSP21 


396.4 


l.le-110 


DCAAA1 1 

PPUUU11 


cassava4.1_U18U66m 


HSP21 


241.1 


2.4e-64 


DCAAA1 1 

PrUUUll 


cassava4.1_U17871m 


lie no o f\ 

HSP22.U 


291.2 


3.8e-79 


DCAAA1 1 

PrUUUll 


cassava4.1_U18U66m 


lie no o rv 

HSP22.U 


241.1 


2.4e-64 


PrUUUll 


cassava4 . 1 _U 1 5 25 6m 


HSP23.6 


396.4 


l.le-110 


PrUUUll 


cassava4.1_U15518m 


ttptmi z: 

HSP23.6 


305.8 


2.3e-83 


PPU4969 


_ _ ., , _ A 1 A1 TO^T ^ 

cassava4.1_U17267m 


ttptmi z: 

HSP23.6 


295.0 


9.6e-80 


PPU4969 


_ _ ., , _ /I 1 a 1 tat /i 

cassava4 . 1 _U 1 7 974m 


HSP23.6 


266.9 


9.5e-72 


PPU4969 


cassava4. l_U22774m 


HSP23.6 


226.9 


l.le-59 


PPU4969 


cassava4. l_032436m 


ucmo z: 

HSP23.6 


352.8 


9.6e-98 


DT7AAA 1 O 

PrUUUlz 


cassava4.1_UU3331m 


ucmA 1 

HSP7U-1 


194.5 


9.2e-59 


T1T7AAA 1 O 

PrUUUlz 


cassava4.1_UU3343m 


ucmA 1 

HSP7U-1 


379.0 


3.0e-105 


PPU5193 


cassava4.1_005898m 


pepsin A 


274.2 


1.3e-73 


UJNKOWN 


_ _ ., , _ A 1 A 1 t /ion ^ 

cassava4.1_U17439m 


Pt2L4 


57.0 


1.8e-8 


PrUUloU 


_ _ ., , _ A 1 A 1 / ^ _ 

cas sava4 . 1 _U 17662m 


ROC1 


323.9 


6.3e-89 


PPUU515 


cassava4 . 1_00444 1 m 


stress -inducible 


305.8 


1.3e-83 


PPU5922 


_ , _ ., , _ A 1 A 1 A 1 AT ^ 

cassava4.1_U19197m 


subtilase 


221.5 


2.9e-58 


PPUU24U 


cassava4.1_U 19468m 


SUM2 


117.1 


2.1e-27 


PPUU578 


cassava4.1_U17973m 


TPX1 


322.0 


1.9e-88 




cassava4.1_019139m 


T FR A1 A 


Z4 / .3 


4.je-oo 


PF07708 


cassava4.1_009650m 


UBQ2 


313.9 


2.6e-160 


PF11976 


cassava4.1_018181m 


UBQ2 


266.5 


6.5e-72 


PF00240 


cassava4.1_019139m 


UBQ3 


247.3 


4.6e-66 


UNKOWN 


cassava4 . 1 _0 1 5 690m 


Unkown 


287.0 


l.le-77 



and distributed in 2 protein families. The gene coding for 
those proteins are distributed in 30 scaffolds for the CPC and 
43 for the NCPC in the cassava GENOME. For the case of 
HSP, the genes coding for 10 HSP identified in CPC and in 
NCPC showed different distribution pattern, being 11 scaf 
folds for the CPC group and 15 scaffolds for the NCPC 
group. These results contribute to the overall observation 
about the functional complexity of HSPs in plants as well as 
the distribution and arrangement of their genes in the cassava 
genome. 



Gene Expression Analysis 

Expression of two genes, one related to chromoplast 
structure definition (Fibrillin gene) and another related to 
chromoplast development (Or-gene coding for Or-protein), 
were analyzed to gain insights on abundance and formation 
of chromoplast in CSR. Four identified gene sequences cod- 
ing for proteins in the CPC (HSP18.1, HSP17.6, HSP17.4 
and HSP21) were used to search for their possible roles in 
differential sequestration of p-carotene in white and intense 



Carotenoid-protein Complexes in Cassava Storage Root 



The Open Biochemistry Journal, 2012, Volume 6 127 




□ White "Yellow 



HSP18.1 HSP25.3 HSP17.4 HSP17.6 OrProtein Fibrillin 
Genes for Carotenoid Associated Proteins 

Fig. (5). Differential levels of transcript for genes coding for a set of proteins related to carotenoid accumulation in white (A) and intense 
yellow (B). Relative values expressed in relation to white roots. 





Dissociaton Temperature(°C) 



Dissociaton Temperature(°C) 



1770 "l 




Dissociaton Temperature(°C) 

Fig. (6). Dissociation curve derived from qRT-PCR amplification for 
present in CPC of white and intense yellow cassava storage root. 

yellow genotypes. Transcript levels for the above listed 
genes were more abundant in the intense yellow than in 
white CSR (Fig. 5) and followed the pattern of carotenoid 
accumulation. 

The dissociation temperature curve patterns shown in 
Fig. (6) of the qRT-PCR amplification of the four genes 
listed above indicates a change in nucleotide coding se- 
quence for HSP21 (Fig. 6D), which is associated with high 
amounts of carotenoid accumulation in the intense yellow 



420 n 




Dissociaton Temperature(°C) 
;enes coding for Fibrillin (A), Or-protein (B), CAP1 (C), CAP2 (D), 



landrace. The overall observation about the functional com- 
plexity of HSPs in plants as well as their distribution and 
arrangements of their genes in the cassava genome. Indeed, 
the sequence of the corresponding amplified fragment (Fig. 
7) revealed four single point mutations in the gene coding for 
this protein that may favor carotenoid sequestration in the 
CSR of intense yellow phenotype. 

In addition, these results corroborate those observed by 
protein blot analysis where Fibrillin- and Or-protein support 
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1. iac 12.829 (White) aggagataa£aaggacacatccgtggatgtgcaagtcaatc cbc agggtagc 

2. Guanamara (White) aggagataacaaggacacatccgtggatgtgcaagtcaatc c«c agggtagc 

3. surbin (Yellow) a g g a g a t a a c a a g g a c a c a t c c g t g g a t g t g c a a g t c a a t c c hc a g>g t a g c 

4. BGM019 (Yellow) A G G A G A T A A C A A G G A C A C A T C C G T G G A T G T G C A A G T C A A T C C «C A G«G T A G C 

60 70 80 90 100 

iiAC 12.829 (White) caaggcacagcagttgaaaaaagacc thggaggc agc tggcbg ttgatgtct 

2. Guanamara (White) C A A G G C A C A G C A G T T G A A A A A A G A C C THGG AGGC AGC TGGCBG TTG A TG TC T 

3. surbin (Yellow) caaggcacagcagttgaaaaaagacc thggaggc agc tggcbg ttgatgtct 

4. bgmoi9 (Yellow) CAAGGCACAGCAGTTGAAAAAAGACC tbgg aggc agc tggcbg ttg a tg tc t 

110 116 

1. iac 12.829 (White) CACCTTTCGGTT 

2. Guanamara (White) CACCTTTCGGTT 

3. Surbin (Yellow) 

4. BGM019 (Yellow) CACCTTTCGGTT 

Fig. (7). Sequence comparison for the amplicon resulting from qRT-PCR of HSP21 from white and intense yellow cassava storage root. 



a role for enhancement of protein carotenoid sink capacity 
(i.e. chromoplast formation and number) as well as a possi- 
ble specific role for HSP21 in the carotenoid sequestration 
mechanism. 

DISCUSSION 

Carotenoid-protein complex (CPC) formation in special- 
ized supramolecular structures inside the chromoplast has 
been associated with several molecular functional interac- 
tions [1,24], including sequestration of specific carotenoids 
that aid in driving the carotenoid synthesis pathway towards 
completion. The characterization of CPC protein composi- 
tion in chromoplast of cassava storage root (CSR) identified 
HSPs as the major type of proteins present in the complex. 
Further, specific markers for proteins (Fibrillin) used in this 
study indicate chromoplasts of intense yellow cassava stor- 
age root are of fibrillar type capable of chromoplast differen- 
tiation [25-27] and carotenoid accumulation [25]. Results 
from this study also suggest that a specific haplotype of 
HSP21 could explain the differential carotenoid accumula- 
tion in intense yellow cassava landraces. 

HSPs are classes of proteins that are unique, abundant 
and diverse in plants, especially in non-green tissue [28,29]. 
Higher plants have at least 20 small Heat Shock Proteins 
(sHSPs) and some species may have up to 40 different 
sHSPs [30]. The sHSPs are involved in response to environ- 
mental stress such as heat, cold, drought and salinity stress 
[31,32]. In the absence of environmental stress, sHSPs ex- 
pression in plants are restricted to certain stages of develop- 
ment such as embryogenesis, seed germination, pollen de- 
velopment, and fruit maturation [33,34]. In the case of CSR, 
we observed that the proteins forming the CPC in root chro- 
moplast are mainly composed of the sHSP family with vari- 
able levels of expression, depending on the type of carote- 
noid accumulating genotype. For instance, transcripts for 
HSP18.1 and HSP21 are more abundant in intense yellow 
cassava storage root where values of P-carotene accumula- 
tion were lOOx more than in white root. While HSP18.1 may 
be responsible for the enhancement of total carotenoid ac- 
cumulation, HSP21 may also play a role in the specificity to 
P-carotene accumulation throughout the sequestration 
mechanism as observed in tomato [35]. In addition, a new 
heteroduplex double strand cDNA sequence for the gene 
coding for HSP21 was observed, suggesting that an isoform 
of this protein is coded by a different gene haplotype pos- 
sesing four single point mutations in the sequence of the 
amplicon (Fig. 7) that may be responsible for the sequestra- 



tion of high P-carotene in intense yellow CSR. In other plant 
systems, the presence of a specific HSP21 protein, has been 
speculated to act as an antioxidant under oxidative stress 
[36] or as an enhancer of carotenoid accumulation in tomato 
fruit of a transgenic plant [35]. More in-depth research on the 
genotypes enriched with special carotenoids from other cas- 
sava landraces, will have a larger impact on carotenoid ge- 
netic improvement in new varieties and strengthen our 
knowledge of protein content enhancement and carotenoid 
regulation in non-green tissue. 

Fibrillin has been found in a number of plants and plays a 
structural role in carotenoid- storing fibrillar type chromo- 
plasts [37]. This prompted us to examine the presence of 
fibrillin in CSR, which could help define the type of ul- 
tramolecular structure of chromoplast present in CSR. Here 
we demonstrated that fibrillin antibody recognized a putative 
cassava fibrillin protein in CES of intense yellow but not in 
white cassava storage root. This evidence supports the pres- 
ence of carotenoid-fibril type in CSR, rather than other type 
of structures observed in roots such as carrot [18] that also 
store high amounts of carotenoids. Based on the discovery of 
Or-protein and its cellular function in inducing formation of 
chromoplasts [25,38], we used Or-protein antibody to recog- 
nize this protein in CSR as a possible indicator of the differ- 
ential number of chromoplasts between intense yellow and 
white CSR. Indeed, Or-protein was observed in CES of in- 
tense yellow but not in white root. This evidence indicates 
the potential for more chromoplasts present in intense yellow 
CSR creating a metabolic sink for carotenoid accumulation, 
similar to that observed in transgenic potato tuber over- 
expressing the Or-gene that accumulate greater levels caro- 
tenoids [39]. 

CONCLUSION 

In conclusion, this study provides evidence that the caro- 
tenoid-protein complex, in cassava storage root, is enriched 
with HSPs that interact with different carotenoids, depending 
on the genetic background of cassava landrace. The presence 
of Fibrillin- and Or-protein may confirm the fibrillar type of 
chromoplast and intense chromoplast formation in storage 
root of intense yellow compared to white cassava CSR. The 
novel allele of HSP21, identified in the present work, may be 
an important tool for genetic manipulation of protein content 
in cassava storage root contributing to a natural sink for 
protein and carotenoid accumulation in intense yellow CSR. 
Further elucidation of the roles of HSP21 in binding specifi- 
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cally with P-carotene or in inducing accumulation of carote- 
noid in CSR would provide a significant contribution to our 
understanding of the molecular machinery controlling caro- 
tenoid accumulation in non-green tissues. 
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